Drop '_pg_collection' in MBridge model config when checkpointing#813
Drop '_pg_collection' in MBridge model config when checkpointing#813
Conversation
Signed-off-by: Asha Anoosheh <aanoosheh@nvidia.com>
📝 WalkthroughWalkthroughThe change modifies configuration key filtering logic in transformer config parsing. It replaces substring-based filtering of keys containing "tp_" with prefix-based filtering for keys starting with "tp_". This alters which config entries are preserved during processing. Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~3 minutes 🚥 Pre-merge checks | ✅ 1 | ❌ 2❌ Failed checks (2 warnings)
✅ Passed checks (1 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #813 +/- ##
=======================================
Coverage 74.17% 74.17%
=======================================
Files 192 192
Lines 19246 19246
=======================================
Hits 14276 14276
Misses 4970 4970 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
What does this PR do?
Type of change: Bug Fix
Overview: Bug report appeared where this was an issue, since we save our own
modelopt_run_config.yamlwhich has repr() issues. The main fix is happening in MCore'sProcessGroupCollection.__repr__, but this is a backup measure since we don't want to save it anyway, regardless. Megatron-LM does not have this issue since only M-Bridge's Model Provider classes extendTransformerConfigand store an extra_pg_collectionattribute.Usage
Testing
Before your PR is "Ready for review"
Additional Information
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.